Speaker identification using subband HMMS

نویسندگان

  • Kenichi Yoshida
  • Kazuyuki Takagi
  • Kazuhiko Ozeki
چکیده

This paper is concerned with optimum band splitting and optimum recombination weights in subband HMM-based speaker identication. In the rst experiment , the full frequency band (8kHz) was split into two subbands, and speaker identication rate was measured for various splitting frequencies and recom-bination weights. It was found that subbands 0-2kHz and 2-8kHz with equal recombination weights gave the best identication rate, outperforming a base-line method without band-splitting. In the second experiment, the full-band was split into three sub-bands with various splitting frequencies. Splitting into 0-2kHz, 2-6kHz, and 6-8kHz gave the best result, slightly outperforming the two-subband case. Finally, four-subband experiment was conducted, the result of which suggests that the speaker information and the phonemic information are complementary to a considerable degree in the spectral domain.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving speaker identification in noise by subband processing and decision fusion

We investigate speaker identification in narrowband noise using subband processing. The output of each subband is used to train and test individual hidden Markov models (HMMs), each making a preliminary decision on speaker identity. Subsequently, these are combined to produce a final decision. For sufficient numbers of filters, subband processing outperforms traditional wideband techniques by a...

متن کامل

Noise - Robust Speaker Recognition Using Subband Likelihoods and Reliable - Feature Selection

Sungtak Kim et al. 89 We consider the feature recombination technique in a multiband approach to speaker identification and verification. To overcome the ineffectiveness of conventional feature recombination in broadband noisy environments, we propose a new subband feature recombination which uses subband likelihoods and a subband reliable-feature selection technique with an adaptive noise mode...

متن کامل

Various Methods for Visual Speaker Identification for Automatic Continuous Speech Recognition in TV Broadcast Programs

This paper is about different methods and algorithms that were used for speaker identification from the video recordings of TV broadcast news transcription. The information from visual speaker identification were used in our complex system for automatic continuous speech recognition of TV broadcast programs because it is possible to use speaker adapted (SA) Hidden Markov Models (HMMs) if we hav...

متن کامل

Speaker Identification in Emotional Environments

The performance of speaker identification is almost perfect in the neutral environment. However, the performance is significantly deteriorated in emotional environments. In this work, three different and separate models have been used, tested and compared to identify speakers in each of the neutral and emotional environments (completely two separate environments). Our emotional environments in ...

متن کامل

Subband architecture for automatic speaker recognition

We present an original approach for automatic speaker identification especially applicable to environments which cause partial corruption of the frequency spectrum of the signal. The general principle is to split the whole frequency domain into several subbands on which statistical recognizers are independently applied and then recombined to yield a global score and a global recognition decisio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999